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Understanding the formation of subjective human traits, such as preference and opinions, is an 
important, but poorly explored problem. An essential aspect is that traits collectively evolve under 
the repeated action of social influence interactions, which is the focus of many quantitative studies 
of cultural dynamics. In this paradigm, dynamical models require that all traits are fixed when 
specifying the “initial cultural state”. Typically, this initial state is randomly generated, from a 
uniform distribution over the set of possible combinations of traits. However, recent work has 
shown that the outcome of social influence dynamics strongly depends on the nature of the initial 
state: if this is sampled from empirical data instead of being generated in a uniformly random 
way, a higher level of cultural diversity is found after long-term dynamics, for the same level of 
propensity towards collective behavior in the short-term; moreover, if the initial state is obtained 
by shuffling the empirical traits among people, the level of long-term cultural diversity is in-between 
those obtained for the empirical and random counterparts. The current study repeats the analysis 
for multiple empirical data sets, showing that the results are remarkably similar, although the matrix 
of correlations between cultural variables clearly differs across data sets. This points towards robust 
structural properties inherent in empirical cultural states, likely due to universal laws governing the 
dynamics of culture in the real world. The analysis suggests, first, that this dynamics operates close 
to criticality and second, that it is driven by more than just social influence, implications which 
were not recognized previously. 


I. INTRODUCTION 

A fundamental challenge for the social sciences con¬ 
sists of understanding the formation (or origin) of prefer¬ 
ences, opinions, values and beliefs - shortly, of “cultural 
traits”. This is intimately linked with understanding hu¬ 
man decision making, thus having strong relevance for 
disciplines like sociology, economics or political science. 
As will become more apparent below, trait formation also 
has implications on the faith of cultural diversity and on 
collective human behavior, problems that have recently 
attracted much attention from natural scientists. How¬ 
ever, the problem of cultural trait formation appears to 
be poorly explored. Almost thirty years ago, Ref. [T] was 
arguing that questions of how preferences form tend to 
either be avoided or receive answers that are only valid 
for certain cultures or societies. Currently, some efforts 
are being made to address such questions GHU, driven by 
the economic relevance of preferences. Arguably, a com¬ 
plete answer to the the problem of trait formation can 
only be given by a general, formal theory of trait dynam¬ 
ics, which would account for both the dynamical nature 
of culture (the fact that traits evolve with time) and the 
multidimensional nature of culture (the fact that, at the 
level of every individual, cultural traits cannot exist in¬ 
dependently of other cultural traits). 

Formulating and validating such a theory is beyond 
the purpose of this study, although this work does pro¬ 
vide certain guidelines in this direction. Nonetheless, it 
is instructive to reflect on the types of processes that 
conceivably drive the dynamics of cultural traits in the 
real world, which can be seen as potential ingredients of 
a general theory. First, a change of an individual trait 
may happen by copying the trait of another individual 


as a result of an interaction - a “social influence” mech¬ 
anism. Second, a change of an individual trait may hap¬ 
pen when real-world experience contradicts an upheld 
personal view of a certain issue - a “reality-based con¬ 
sistency” mechanism. Third, a change may be a conse¬ 
quence of the person reconsidering his/her position with 
respect to a certain issue, due to some inconsistency or 
tension with his/her position on some other issue - a 
“cross-topic consistency” mechanism. Most likely, there 
are other relevant ingredients that one may think of, 
likely having to do with how the human mind works, 
but these are enough for illustrating the complexity of 
the problem of trait formation - obviously, these mech¬ 
anisms would manifest in parallel and would be coupled 
to each other at the level of every individual. For the 
rest of this manuscript, it is convenient to use the term 
“exogenous dynamics” to refer to trait dynamics driven 
by social influence. By contrast, “endogenous dynamics” 
is used to jointly refer to all other, potentially relevant 
ingredients which may be defined independently of social 
influence, such as the reality-based and cross-topic consis¬ 
tency mechanisms mentioned above - this “endogenous- 
exogenous” distinction should not be confused with that 
made in Ref. [2], which uses the same terminology for a 
different purpose, although within a similar context. 

The exogenous dynamics of traits has recently received 
much attention from a highly interdisciplinary research 
community, with significant input from physics [5j , which 
lead to the development of various models and quantita¬ 
tive tools. In addition to embracing the dynamical nature 
of traits, this approach often also embraces the multidi¬ 
mensional nature of culture, in which case it referred to 
as “cultural dynamics” - in contrast to similar research 
focusing on single-dimensional dynamics, in which case it 
is referred to as “opinion dynamics”. This work mainly 
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builds on previous work in “cultural dynamics”, for which 
the so-called Axelrod model [6] is very representative. In 
this paradigm, an individual is understood as a long se¬ 
quence of cultural traits, or a “cultural vector”, which 
evolves in time. Every entry of the vector corresponds 
to one dimension of culture, also referred to as cultural 
variable or “cultural feature”. There are many versions 
of the Axelrod model present in the literature, each incor¬ 
porating a different mix of ingredients along with social 
influence mm- As far as the authors are aware, neither 
of these versions includes endogenous processes in its dy¬ 
namical rules - but see the discussion concerning “cul¬ 
tural drift” in Sec. |V A| The need of bringing together 
exogenous and endogenous ingredients has also been re¬ 
cently signaled by Ref. [16] (using a different terminol¬ 
ogy) , which is also proposing a first model which achieves 
such an integration. Unlike Ref. [16], the present work 
does not attempt an exogenous-endogenous integration 
on the theory side. Instead, it studies the joint manifes¬ 
tation of endogenous and exogenous processes in avail¬ 
able empirical data, which consists of partial snapshots 
of the real world dynamics, as explained below. 

One might argue that the effects of endogenous pro¬ 
cesses are actually accounted for when specifying the ini¬ 
tial conditions of exogenous dynamics models, namely 
the set of initial cultural vectors - the “initial cultural 
state”. However, the initial conditions are usually spec¬ 
ified randomly, using a uniform probability distribution 
over the set of possible cultural vectors - a uniform dis¬ 
tribution in “cultural space”. As an exception, Ref. m 
is using a non-uniform distribution, but one that can be 
factorized as a product between Poisson, feature-level dis¬ 
tributions, which does not entail any correlations. An ap¬ 
parent, a-posteriori justification of the uniform approach 
is that the main outcomes of exogenous dynamics seem to 
be largely independent of the specific realization of the 
initial cultural state, but dependent on the dynamical 
rules. However, this is tautological, as it is only valid for 
the class of cultural states that are representative of the 
uniform probability distribution. Moreover, it is difficult 
to believe that uniform randomness is a good description 
of the outcome of realistic endogenous processes - either 
alone or coupled to exogenous processes taking place “be¬ 
fore the initial condition”. 

In fact, Refs. m and Pj3] showed that the initial con¬ 
ditions actually have a strong contribution in determin¬ 
ing the outcome of exogenous dynamics: an initial cul¬ 
tural state constructed from empirical data seems to in¬ 
duce significant deviations with respect to the uniformly 
random case. On one hand, this suggests that there is 
much to be understood about the role of endogenous pro¬ 
cesses in preference formation and its interplay with ex¬ 
ogenous processes. On the other hand, this implies that 
exogenous dynamics models can be used for probing the 
structure of the initial cultural state. Thus, if the cul¬ 
tural vectors in the initial state correspond to real indi¬ 
viduals, the outcome of exogenous dynamics can be used 
as a quantitative tool for gaining insight about how real 


individuals are distributed in cultural space, and indi¬ 
rectly about trait formation in the real world. This is, 
to a great extent, the direction of the research presented 
here. 

A set of cultural vectors constructed from empirical 
data, also referred to below as an “empirical cultural 
state”, is how the notion of “culture” is formalized in this 
study. It should be understood as a partial snapshot of 
cultural dynamics taking place in the real world: the set 
of cultural vectors corresponds to a random subset o indi¬ 
viduals from a society, whose cultural traits are recorded 
within a relatively short period of time - typically via 
a social survey. Obviously, an empirical cultural state 
encodes all effects of trait formation - thus of both en¬ 
dogenous and exogenous processes - taking place in that 
social system before the snapshot. This is implicit for 
all the analysis and modeling of empirical cultural states 
done throughout this study. Separating the effects of en¬ 
dogenous processes from those of exogenous processes is 
something inherently difficult Q which is not attempted 
at this point. 

It is desirable that knowledge gained about trait forma¬ 
tion in the real world has a “universal” nature, namely 
that the knowledge is valid across societies and across 
time. Consequently, studying the empirical cultural state 
is done with the aim of unraveling universal structural 
properties, that are independent of the particular socio¬ 
cultural system from which the data is obtained. These 
properties could then be interpreted as signatures of gen¬ 
eral laws governing trait formation and used to indirectly 
achieve knowledge about these laws. The need of finding 
such generic properties, if they exists, is one of the essen¬ 
tial motivations for a physics-inspired research approach. 
One can indeed wonder whether, just like uniformly ran¬ 
dom initial conditions lead to certain universal outcomes, 
different classes of initial conditions, in particular those 
constructed from empirical data, lead to different, but 
still universal outcomes. In other words, one would like 
to identify universality classes of cultural states, and in 
particular check whether empirical states associated to 
different data sources fall within the same universality 
class. The current work does not yet provide a rigorous 
way of identifying these universality classes in a statis¬ 
tical physics sense. However, it does emphasize remark¬ 
able statistical similarities between different data sets. 
These similarities suggest that different empirical cul¬ 
tural states might indeed belong to the same universality 
class. Formally, this is done using a certain mathemat¬ 
ical tool which actually relies on a model of exogenous 
dynamics. 

This tool, inspired by Ref. m, is effectively a 2- 
dimensional plot which shows, for any given initial cul- 


1 Even the actual person might often not be able to tell what ex¬ 
actly caused a specific change of one of his/her traits, or whether 
it occurred “exogenously” or “endogenously” - in most cases 
both aspects would likely contribute in an intricate way, while 
parts of the combined process would take place subconsciously. 
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tural state, how a measure of long-term cultural diver¬ 
sity (LTCD) depends on a measure of short-term collec¬ 
tive behavior (STCB) - or short-term social coordination. 
The LTCD quantity estimates the extent to which cul¬ 
tural diversity survives after a long period of exogenous 
trait dynamics governed by consensus-favoring social in¬ 
fluence, in the absence of any other process. This is done 
using an Axelrod-type model [B] of trait dynamics. The 
STCB quantity estimates the propensity of the initial 
state to collective behavior on the short-term. This is 
done using a modification of the Cont-Bouchaud model 
[20| of social coordination. While LTCD is a property 
of the final state, STCB is a property of the initial state 
of the exogenous dynamics model. However, the defini¬ 
tion of the STCB does employ, in a more implicit way, 
the idea of single-opinion dynamics driven by social in¬ 
fluence, but taking place on a much shorter time-scale. 
In fact, as described in Sec. ns both the LTCD and the 
STCB quantities are functions of the same free parame¬ 
ter, the bounded confidence threshold ui, which controls 
the maximal distance in cultural space for which social 
influence can operate. The common dependence on this 
parameter is what allows for LTCD to be expressed as a 
function of STCB. 


Ref. [IB] showed that STCB and LTCD are mutually 
exclusive when the initial cultural state is generated in 
an uniformly random way. In other words, for uniformly 
random data, either the STCB or the LTCD attains a 
close-to-minimal value for any value of the bounded con¬ 
fidence threshold, apparently calling for a more com¬ 
plicated description or otherwise suggesting a paradox, 
since real-world societies seem to allow for both short¬ 
term collective behavior and long-term cultural diver¬ 
sity. However, when the initial cultural state is con¬ 
structed from empirical data, the two aspects become 
clearly more compatible, with both quantities attaining 
intermediate values for a certain region of the threshold. 
This appears a parsimonious way of bringing about the 
expected coexistence of LTCD and STCB. Besides the 
uniformly random and empirical cultural states, a third 
class of initial conditions can be constructed, by shuf¬ 
fling the empirical traits among vectors, thus retaining 
only part of the information in the empirical data. This 
third class was found to entail a level of compatibility 
of LTCD and STCB which is intermediate between those 
obtained with empirical and uniformly random data. The 
current study is dedicated to checking the robustness of 
the LTCD-STCB behavior across different empirical data 
sets. As shown below, this behavior is indeed universal. 


The rest of this manuscript is organized as follows. Sec. 
H] rigorously explains the notion of “cultural state”, by 
starting from the related concepts of “cultural space”, 
“cultural feature”, “cultural trait”, “cultural distance” 
and “cultural vector”. It also explains what it means 
to “shuffle” and to “ rand omize” a set of cultural vectors 
(SCV). Then, Sec. Ill describes two important quan¬ 
tities, associated to the notions of “long-term cultural 
diversity” (LTCD) and “short-term collective behavior” 


(STCB), as well as how these are combined to provide 
a quantitative tool that can emphasize the structure im¬ 
plicit in an empirical cultural state ITS]. Finally, Sec. 
m shows that this structure is robust across data sets 
and across geographical regions. Sec. [V] gives detailed 
discussions of results presented throughout the study, of 
possible criticism and of questions that can be further 
investigated. The manuscript is concluded in Sec. |VI| 
Note that, although most notions explained in Sec. |II| 
and Sec. |III| have already been used in previous work, 
they are worth explaining at length here, due to their 
systematic usage with multiple data sets, as well as for 
clarifying various aspects that were previously only im¬ 
plicit. 


II. THE FORMAL REPRESENTATION OF 
CULTURE 


To understand how a cultural state is formally encoded 
here, it is worth starting with the notion of “cultural 
space”. This originates from the development and study 
of models of exogenous trait dynamics, in particular of 
Axelrod-type models [B]. In this paradigm, one deals 
with a set of variables, called “cultural features”, which 
encode information about various properties that individ¬ 
uals can have, properties that are inherently subjective 
and that can change under the action of “social influence” 
arising during person-to-person interactions. By con¬ 
struction, these variables are allowed to attain only spe¬ 
cific values which are here called “cultural traits”. The 
interpretation here is that cultural traits encode “pref¬ 
erences”, “opinions”, “values” and “beliefs” that people 
can have on various topics, where each topic is associated 
to one feature. 

A cultural space consists of the set of all possible com¬ 
binations of cultural traits entailed by the set of chosen 
cultural features, together with a measure of dissimilarity 
between any two combinations. Moreover, this dissimi¬ 
larity, also called the “cultural distance”, is defined in 
such a way that it satisfies all the properties of a met¬ 
ric distance (non-negativity, identity of indiscernibles, 
symmetry and triangle inequality). The so-called “Ham¬ 
ming” distance is commonly employed for this purpose, 
which is meaningful as long as there is no obvious order¬ 
ing of the traits of any feature. A cultural space is thus 
an abstract, discrete, metric space, where each point cor¬ 
responds to a specific combination of traits. However, 
the cultural space is mathematically not a vector space, 
since there is no notion of additivity attached to it. 

A cultural state is essentially the selection of points in 
the cultural space that needs to be specified for the ini¬ 
tial state of models of exogenous dynamics. Such a selec¬ 
tion is also referred to here as a “set of cultural vectors” 
(SCV), where one “cultural vector” is one possible combi¬ 
nation of traits. Formally, this is not a set in the rigorous 
sense, but a multiset, since it may contain duplicate ele¬ 
ments — identical sequences of traits. However, duplicate 
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elements will rarely occur in the initial states constructed 
for this study, since the number of cultural vectors is in 
practice much smaller than the number of possible points 
the cultural space. Throughout this study, the notion 
of SCV is used interchangeably with that of “cultural 
state”. 

At this point, it may also be convenient to reflect on 
the more abstract notion of “cultural space distribution” 
(CSD), as a discrete probability mass function taking the 
cultural space as its support. If the SCV is constructed 
in a uniformly random way, one implicitly assumes that 
the cultural space distribution is constant, namely that 
all combinations of traits are equally likely. If, how¬ 
ever, the SCV is constructed from empirical data, the 
inherent structure may be thought to correspond to non¬ 
homogeneities in an underlying CSD, for which the data 
is representative. Explicitly measuring or working with 
such an object is impractical for all purposes, at least 
for the present study, which is mainly due to the large 
number of points, the high a-priori dimensionality char¬ 
acterizing a typical cultural space and the sparseness of 
a sampling associated to an empirical set of cultural vec¬ 
tors. Nonetheless, the CSD, as a concept, may be useful, 
for instance when developing stochastic models for gen¬ 
erating realistic SCVs - some steps in this direction have 
been taken in Ref. m- 


Of great importance for this study are the empirical 
SCVs, which are mainly constructed from social survey 
data. Cultural features are obtained from the questions 
that are asked in the survey, where the traits of each fea¬ 
ture correspond to the answers that can be given to the 
respective question. Thus, a cultural vector consists of a 
sequence of answers that one individual has given to the 
list of questions in the survey. Importantly, a question 
is selected and encoded as a feature only if it is reason¬ 
ably subjective, meaning that it does not ask about de¬ 
mographic or physical aspects concerning the individual, 
like place of residence, marital status, age, weight, and 
that every allowed answer should be plausible at least 
from a certain perspective of looking at the question, or 
for people with a certain background or a certain way 
of thinking. Moreover, a question is disregarded if the 
survey is defined in such a way that its list of a-priori 
allowed answers depends on what answers are given to 
other questions. All features remaining after this filter¬ 
ing - more details are provided in Sec. 0o f the Appendix 
- are assumed to contribute equally to the cultural dis¬ 
tance, but the way they contribute depends on whether 
they are treated as nominal or as an ordinal variables. 
Specifically, the cultural distance dtj between two vec¬ 
tors i and j is computed according to: 


dl i ~ 1 
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where F is the number of cultural features with k iter¬ 
ating over them, f k om is a binary variable encoding the 
type of feature k (1 for nominal and 0 for ordinal), q k 
is the range (number of traits) of feature k, 5(a,b) is a 
Kroneker delta function of traits a and b (of the same 
feature) and x k is the trait of cultural vector i with re¬ 
spect to feature k. This definition reduces to the Ham¬ 
ming distance in case there are only nominal variables 
present. The second equality sign gives a formulation of 
the cultural distance as a sum over feature-level cultural 
distance contributions d k j/F. 

These feature-level contributions allow one to formu¬ 
late, following Ref. [T5], a notion of feature-feature co- 
variance: 


hi 
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valid for any two features k and l, regarldess of f k om 
and /„ om . Note that the averaging is performed over all 
iV(iV — l)/2 distinct pairs of cultural vectors, 

rather than over all N cultural vectors. The feature- 
feature covariances can be used to define the feature- 


feature (Pearson) correlation: 


p k ’ l = 


_. k , l 


y/ a k ’ k a 1 - 1 


( 3 ) 


which measures the extent to which large/small distances 
in terms of feature k are associated to large/small dis¬ 
tances in terms of feature l. One can definitely see the 
FxF correlation matrix p as a reflection of a CSD that is 
compatible with the data. In general, however, the cor¬ 
relation matrix will only retain part of the information 
encoded in the CSD, first because p k ’ 1 retains only part 
of the information in the 2-dinrensional contingency table 
of features k and l , second because a CSD is essentially 
an F-dimensional contingency table, which might imply 
all kinds of higher-order correlations. 

Assuming the definition of cultural distance given by 
Eq. Q. a cultural space is already specified by the list 
of features taken from an empirical data set, together 
with the associated ranges and types. In this empirically- 
defined cultural space, it is meaningful to talk about 
several types of SCVs. First, an empirical SCV is con¬ 
structed from the empirical sequences of traits of the in¬ 
dividuals selected from those sampled by the survey. Sec¬ 
ond, a shuffled SCV is constructed by randomly permut- 
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ing the empirical traits among individuals, independently 
for every feature. Third, a random SCV is constructed 
by randomly choosing the trait of every person, for every 
feature. Note that the shuffled SCV exactly reproduces, 
for each feature, the empirical frequency of each trait, 
while disregarding all information about the frequencies 
of co-occurrence of various combinations of traits of two 
or more different features. Thus, shuffling destroys all 
feature-feature correlations p k,t , as well as any higher- 
order correlations entailed by the empirical SCV, retain¬ 
ing only the information encoded in the marginal proba¬ 
bility distributions associated to individual features. On 
the other hand, a random SCV retains nothing of the 
information inherent in the empirical SCV. 

Finally, note that the mathematical definition of cul¬ 
tural distance illustrated by Eq. |T]), already used in 
Refs. [TB] in [T5], is not unique nor very sophisticated. 
Other definitions might capture differences in opinions, 
preferences, values, beliefs, attitudes and associated be¬ 
havior tendencies in better, more precise ways - see Ref. 
ED for a sophisticated approach. However, the current 
definition is arguably good enough for the problems ex¬ 
plored in this study and for how they are attacked. Since 
this research approach - using exogenous trait dynam¬ 
ics for probing the structure of cultural states - is only 
in its infancy, keeping all quantitative tools and models 
as simple as possible is desirable. Moreover, the current 
definition of cultural distance constitutes a good bridge 
between exogenous dynamics models and available social 
survey data. 


III. LONG-TERM CULTURAL DIVERSITY 
AND SHORT-TERM COLLECTIVE BEHAVIOR 

This section focuses on two quantities that are evalu¬ 
ated on sets of cultural vectors, namely the LTCD and 
STCB quantities mentioned above, which are defined in 
the same way as in Ref. jTSj! ■ Each of these quantities is 
based, explicitly or implicitly, on the idea of trait dynam¬ 
ics driven by social influence in a population of interact¬ 
ing agents. Each agent is associated, for the initial con¬ 
ditions, to one of the cultural vectors in the SCV that is 
studied. For simplicity, both quantities assume that there 
is no physical space nor a social network that would con¬ 
strain the interactions between agents. In both cases, the 
interactions are assumed to only be constrained by how 
the agents are distributed in the cultural space. Specifi¬ 
cally, only if the distance between two cultural vectors is 
smaller than the bounded confidence threshold ui are the 
two agents able to influence each other’s opinions in favor 
of local consensus. This effectively says that there needs 
to be enough similarity between the cultural traits of two 
people if any of them is to convince the other of anything. 
This picture is inspired by assimilation-contrast theory 
[22] . Ref. [lOj being the first study that explicitly uses 
the bounded-confidence threshold in the context of trait 
dynamics, after having already been in use in the context 


of opinion dynamics for some time - see Ref. |23] for an 
overview. The bounded confidence threshold w functions 
like a free parameter on which both the LTCD and the 
STCB quantities depend, for any given SCV. 

The LTCD quantity is a measure of the extent to which 
the given SCV favors cultural diversity on the long term, 
namely a survival of differences in cultural traits at the 
macro level, in spite of repeated, consensus-favoring in¬ 
teractions at the micro level. In the real world, bound¬ 
aries between populations belonging to different cultures 
appear to be resilient with respect to social interactions 
across them EH - ED] . The measure relies on a Axelrod- 
type model jjjj of cultural evolution with bounded con¬ 
fidence, which is applied on the SCV. This is meant to 
computationally simulate the evolution of cultural traits 
under the action of dyadic social influence, in the ab¬ 
sence of other processes that would be present in reality. 
According to this model, at each moment in time, two 
agents are randomly chosen for an interaction. If the 
distance between their cultural vectors is smaller than 
the threshold ui, then, with a probability proportional 
to 1 — w, for one of the features that distinguishes be¬ 
tween the two vectors, one of the agents changes its trait 
to match the other. With time, agents become more 
similar to those that are within a distance oj in the cul¬ 
tural space. The dynamics stops when several groups 
are formed, within which agents are completely identical 
to each other, but too dissimilar to those in other groups 
for any trait-changing interaction to occur. These groups 
are called “cultural domains”, term formulated in the 
context of the original Axelrod model [5j, which also in¬ 
cluded a physical/geographical, 2-dimensional lattice but 
no (explicit) bounded confidence threshold. The normal¬ 
ized number of such cultural domains for a given value of 
w, averaged over multiple runs of the model, defines the 
LTCD quantity: 

LTCDM = (4) 

where Nd is the cultural domains in the final (or absorb¬ 
ing) state of this model, the normalization being made 
with respect to V, the size of the SCV. 

The STCB quantity is a measure of the extent to which 
the given SCV favors collective behavior (or social coordi¬ 
nation) on the short term, namely the extent to which the 
agents associated to the cultural vectors in the set would 
tend to take actions or make choices in a similar, coor¬ 
dinated way rather than independently from each other. 
Bursts of fashion and popularity [271125] . rapid diffusion 
of rumors, gossips and habits[5] HD] and speculative bub¬ 
bles and herding behavior on the stock markets EDI ED 
are real-world examples of collective behavior on the 
short term. The measure relies on a Cont-Bouchaud type 
model ED] , which deals with an aggregate choice or opin¬ 
ion of the entire agent population on one issue, which for 
simplicity is assumed here to be represented by a binary 
variable, such as liking vs disliking an item. According to 
the model, when collectively confronted with this issue, 
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the agents within a connected group effectively make the 
same choice or express the same opinion. In this con¬ 
text (where physical space and social network are dis¬ 
regarded), a connected group is a subset of agents that 
form a connected component in the graph obtained by 
introducing a link for every pair (i,j) of agents that are 
culturally close enough to socially influence each other 
dij < uj. Based on this approximation, the aggregate, 
normalized choice of the entire population is expressed as 
a weighted average over the choices of the connected com¬ 
ponents, where the weight of the A ’th component is the 
size Sa of this component. However, the group choices 
themselves are still assumed to be binary, equiprobable 
random variables with values {—1, +1}- Thus, the aggre¬ 
gate, normalized choice is also a random variable, but one 
that is non-uniformly distributed over some set of ratio¬ 
nal numbers within [—1,1], in a manner that depends on 
the set of group sizes {SaJoj induced by a specific value of 
the uj threshold. The spread of this aggregate probabil¬ 
ity distribution provides the coordination measure that 
defines the STCB. It turns out that this quantity can be 
analytically computed, for a given uj, according to [18] : 


STCB(w) = 



where the summation is carried over the cultural con¬ 
nected components labeled by different A values. Note 
that only the sizes Sa of the components enter the cal¬ 
culation, which are in turn determined by the cultural 
graph obtained by applying the uj threshold to the set of 
pairwise cultural distance contributions d between the 
cultural vectors. As visible in the formula, the STCB is 
higher when the agents are more concentrated in fewer 
and larger cultural connected components. 

There is a crucial difference between the ideas behind 
the LTCD and the STCB measures: while the former 
assumes that agents move in cultural space under the ac¬ 
tion of social influence, the latter assumes that the agents 
remain fixed in cultural space while they make their de¬ 
cision on one issue with which they are just being con¬ 
fronted. Although, for computing the STCB, the reason¬ 
ing in terms of connected cultural components implicitly 
assumes that social influence occurs within these com¬ 
ponents, this influence is supposedly too superficial and 
too short-lived too also alter the cultural vectors them¬ 
selves. Thus, as their names suggest, the LTCD and 
STCB quantities are concerned with two different time- 
scales: a long time-scale for which cultural vectors and 
distances are dynamic and a short time-scale for which 
cultural vectors and distances are fixed. Moreover, while 
LTCD requires computer simulations, the STCB is com¬ 
puted in an analytical way, although it does require an 
algorithmic step of finding the connected components in 
the cultural graph that is obtained by thresholding the 
matrix of cultural distances. Thus, LTCD can be seen as 
a characteristic of the final cultural state resulting from 
a long, exogenous dynamics process, while the STCB can 


be can be seen as a property of the initial cultural state. 
Nonetheless, they both take as input the set of cultural 
vectors, and, as shown in Ref. [T8], the combination of 
the two quantities turns out to be sensitive to the struc¬ 
ture encoded in the set of cultural vectors. 


It is worth explicitly illustrating, with Fig. [EJ the be¬ 
havior of the LTCD and the STCB quantities for a ran¬ 
dom SCV. The SCV is defined with respect to the cul¬ 
tural spac e of one of th e data sets introduced in Sec. 
IV Figs. 1(a) and |l(b)| show, respectively, the depen¬ 


dence of the LTCD and STCB measures on the bounded- 
confidence threshold uj, while Fig. 1(c) shows the corre¬ 
spondence between the LTCD and STCB measures ob¬ 
tained by eliminating uj. The same data points are used 
for all 3 plots, where each point records all the 3 quan¬ 
tities (LTCD, STCB and uj). The LTCD quantity is av¬ 
eraged, for each point, over 10 runs of the exogenous dy¬ 
namics model, with the associated standard deviations 
shown by the error bars. 


Fig. |1 (a) | shows that LTCD is decreasing with uj: in 
the limit of large N, LTCD goes from 1 to 0 as uj goes 
from 0 to 1. This can be understood by keeping in mind 
that to controls the range of interaction between agents in 
the cultural space. In general, convergence of agents hap¬ 
pens in parallel in several regions of the cultural space, 
towards several points that are out of range of each other. 
The interaction range uj controls the expected number of 
such convergence points, which in turn determines the 
expected number of cultural domains in the final state 
and thus the LTCD value - the latter three quantities de¬ 
crease with increasing uj. If uj is small enough, there is ef¬ 
fectively no successful interaction and thus no movement 
in cultural space, so each agent “converges” to one, dis¬ 
tinct point (assuming that all vectors are different from 
each other in the initial state). If uj is large enough, all 
agents tend to converge to the same point in the cultural 
space. Note that, in terms of uj, these two extreme cases 
are actually two regimes, separated by a sharp decrease 
of LTCD over some intermediate uj interval. This sharp 
decrease can actually be understood as an order-disorder 
phase transition, where the disordered phase corresponds 
to low uj, while the ordered phase corresponds to high uj. 
This transition has been previously studied in the context 
of the Axelrod model [HJ CCO ; although in terms of a dif¬ 
ferently defined control parameter - the (average) feature 
range q rather than the bounded-confidence threshold uj. 


Fig. |1 (b) | shows that STCB is decreasing with uj: in 
the limit of large N, STCB goes from 0 to 1 as w goes 
from 0 to 1. This can be understood by keeping in mind 
that uj controls the extent to which agents are cultur¬ 
ally connected to each other. Higher uj implies fewer, 
but larger connected components in the cultural graph, 
thus a higher predisposition for coordination. If uj is 
small enough, there is one connected component for every 
agent, while if ui is small enough, there is one connected 
component containing all agents. Similarly to above, 
these two cases correspond to two regimes separated by a 
sharp increase of STCB, which can be again understood 
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FIG. 1. Dependence of long-term cultural diversity and of short-term collective behavior on the bounded confidence threshold 
and their incompatibility, for a random set of cultural vectors. Showing the LTDC(cu) dependence ! (a) | the STCB(cj) dependence 
m and the a;-induced LTDC-STCB correspondence [(cj| for a random set of cultural vectors of N = 500, in the cultural space 
of the Eurobarometer (EBM) data set (see Sec. |IV[). 


as a phase transition - it is actually a symmetry breaking 
phase transition, as explained in Ref. |18j . 

Fig. E c)]shows that, as ui increases, one goes from the 
upper-left corner (high LTCD, low STCB) to the lower- 
right corner (low LTCD, high STCB), by first passing 
through the lower-left corner (low LTCD, low STCB). In 
other words, the sharp decrease of LTCD happens before 
the sharp increase of STCB, meaning that the critical co 
of the LTCD phase transition is lower than that of the 
STCB phase transition. This is also visible at a close, 
comparative inspection of Figs. |l(a)| and |l(b)[ The co- 
region for which both the LTCD and the STCB attain low 
values corresponds to a special situation for which there is 
a relatively high level of convergence in the final cultural 
state (low LTCD), in spite of a relatively low level of 
connectivity in the initial cultural state (low STCB). This 
is apparently explained by the fact that movement in 
cultural space at a certain moment during the exogenous 
dynamics simulation facilitates further movement that 
would not have been possible at an earlier moment, so it 
is enough to have a few pairs of agents that can initially 
influence each other to gradually set a large fraction of 
the other agents in motion and in the end achieve a large 
amount of convergence. In any case, Fig. |l(c)| shows that 
at least one of the two quantities has to attain a close- 
to-minimal value, regardless of the bounded-confidence 
threshold to. 


According to the considerations above, long-term cul¬ 
tural diversity and short-term collective behavior seem to 
be mutually exclusive, suggesting a paradox pj3|, at least 
if one accepts that real socio-cultural systems allow for 
both aspects. However, the above calculations make use 
of a random SCV, meaning that the entire argument re¬ 
lies on the assumption that the underlying cultural space 
distribution is uniform. In fact, Ref. [18] showed that an 
empirical SCV allows for much more compatibility, with 


both quantities attaining intermediate values for a cer¬ 
tain co interval - as shown in Sec. IV this translates to 
a hig her LTCD-STCB curve than the one shown in Fig. 
1(c) - meaning that the apparent paradox is solved by 


using realistic data about cultural traits. Moreover, a 
shuffled SCV entails a compatibility level that is in be¬ 
tween those entailed by a random and by an empirical 
SCV. Thus, Ref |T5] showed that an empirical SCV has 
enough structure to dramatically affect the behavior of 
exogenous dynamics acting upon it, aspect which had 
been neglected in the past. 


IV. UNIVERSALITY OF STRUCTURAL 
PROPERTIES 


The findings of Ref. [TB] are based on one data set. It 
is important to understand whether the observed prop¬ 
erties are in fact robust across different populations and 
across different topics. This is accomplished by repeat¬ 
ing the analysis of Ref. m on four data sets. These 
are taken from different sources, thus containing differ¬ 
ent cultural features and recording the traits of different 
people. The four data sources are: the Eurobarometer 
(EBM), containing opinions on science, technology and 
various European policy issues of people in EU countries 
\-Y2 \ : the General Social Survey (GSS), containing opin¬ 
ions on a great variety of topics of people in the US [55] : 
the Religious Landscape (RL), containing religious be¬ 
liefs and attitudes on certain political issues of people in 
the US [35 ; Jester, containing online ratings of jokes [551 . 

Fig. @ shows that the properties highlighted by the 
LTCD-STCB curves are indeed universal. The 4 panels 
correspond to the 4 empirical data sets that are used. 
In each panel, the 3 curves correspond to the 3 levels 
of preserving the empirical information: full information 
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(red), corresponding to the empirical SCV; partial in¬ 
formation (blue), corresponding to the shuffled SCV; no 
information (black), corresponding to the random SCV. 
Note that, for every data set, the empirical SCV allows 
for more compatibility between LTCD and STCB than 
the shuffled SCV, which in turn allows for more compati¬ 
bility than the random SCV. Also note that the empirical 
LTCD-STCB correspondence is always close to the sec¬ 
ond diagonal. These qualitative observations constitute 
the basis for the claim of there being universal structural 
properties underlying empirical sets of cultural vectors. 

In relation to aspects discussed at the end of Sec. |TT} 
the raising of the LTCD-STCB curve from the random 
to the shuffled and further to the empirical CSV visi¬ 
ble in Fig. [2] is related to the LTCD phase transition 
coming closer to the STCB phase transition. In terms 


of w, for the random case, the LTCD phase transition is 
almost over when the STCB phase transition begins, for 
the shuffled case there is more overlap between the high-w 
part of the former and the low-w part of the latter, while 
for the empirical case there is an almost perfect overlap 
between the two. The empirical behavior is illustrated 
by Fig. I within the ui £ [0.2, 0.4] interval, the decrease 
in LTCD is systematically accompanied by an increase in 
STCB. If one accepts that real-world systems are favor¬ 
able for both LTCD and STCB and that the respective 
quantities used here are defined in a sensible way, this 
reasoning suggests that real-world systems function close 
to criticality, from the perspective of both measures: only 
at criticality or close to it are both quantities allowed to 
attain non-vanishing values in the empirical case. In or¬ 
der to stay away from criticality, the system would need 
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to abandon either the propensity towards LTCD or the 
propensity to STCB. This suggests, as a speculation or 
conjecture, that the concept of self-organized criticality 
(36] might actually play an important role in describing 
trait formation and the interplay between exogenous and 
endogenous dynamics of traits. If this is correct, then 
a complete theory of cultural dynamics should have no 
need of fine-tuning the ui parameter. 

Another important aspect is the robustness of the 
LTCD-STCB curves of Fig. [5] when switching from one 
geographical region to another, which is illustrated here 
by Fig. a This is done by focusing on the two data sets 


which allow for division of the sample in terms of geo¬ 
graphical regions, namely the Eurobarometer and the Re¬ 
ligious Landscape. Moreover, only the nominal-variable 
information in the Eurobarometer is being used, for re¬ 
ducing the computational time required by the exoge¬ 
nous dynamics model, as well as for illustrating the ro¬ 
bustness of the results with respect to the sample of cul¬ 
tural variables that are used. The empirical and shuffled 
LTCD-STCB curves are being shown for 5 EU countries 
(left) and for 5 US states respectively (right). Only one 
random curve is shown, because, for a specific data set, 
the country/state-level SCVs are defined with respect to 
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the same cultural space, which is fully determined by 
the types and ranges of variables in the empirical data, 
which are the same regardless of the sample of people. 
Note that, for both data sets, the empirical and shuffled 
curves fall into clearly distinguishable bands. The em¬ 
pirical curves are systematically above the shuffled ones, 
while being again close to the second diagonal. This con¬ 
firms the geographical universality of the structural prop¬ 
erties inherent in empirical data. 

When confronted with these results, one thinks of 
unavoidable similarities between questions in the sur¬ 
vey, which induce correlations between cultural features. 
Since these correlations are destroyed by the shuffling 
procedure, it is tempting invoke them as an explanation 
for the discrepancy between an empirical LTCD-STCB 
curve and its shuffled counterpart. However, there is 
no reason to believe that such similarities are equally 
present in different empirical data sets, or that they are 
similarly distributed among the pairs of questions in the 
data set, since different data sets rely on completely dif¬ 
ferent sets of variables. In fact, the measured feature- 
feature correlations p k ’ 1 , defined via Eq. ^ are quite 
different across the four data sets used here. This is 
illustrated by Fig. [5j which shows how the values of 
these correlations are distributed for the different em¬ 
pirical SCVs (left), while also showing, for comparison, 
the distributions for their shuffled counterparts (right), 
which, as expected, are strongly peaked around 0 (the 
empirical and shuffled correlation matrices are shown in 
Figs. [6] and [7] of Appendix Sec. H- The departure of 


the empirical distribution from its shuffled counterpart 
is clearly different across data sets, whereas the depar¬ 
ture of the empirical LTCD-STCB curve from its shuffled 
counterpart is very similar across data sets, as shown in 
Fig. H Moreover, feature-feature correlations are typ¬ 
ically small, given that any pi~ : i can take values within 
the [—1,1] interval. These are indications that the prop¬ 
erties captured by the LTCD-STCB plot are not (or not 
exclusively) due to feature-feature correlations, and that 
additional information destroyed by shuffling (including 
higher-order correlations) plays an important role. Such 
considerations enforce the belief that the observed prop¬ 
erties are due to a more subtle, dynamical and universal 
mechanism. 


V. DISCUSSION 

This section discusses at length various points that 
have been open throughout the study, while mention¬ 
ing possible directions for future research. First, Sec. 
|V A| further clarifies, at a rather philosophical level, the 
greater context and purpose of this research, while elab¬ 
orating on concepts presented at the beginning of in Sec. 
[T] mainly revolving around the contrast between endoge¬ 
nous and exogenous aspects of trait formation. Second, 
Sec. |V B | elaborates on certain assumptions, implications 
and possible weaknesses of the analysis conducted here. 
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A. Endogenous vs exogenous trait dynamics 


In order to explain the problem of cultural trait for¬ 
mation and how this relates to the existing cultural dy¬ 
namics paradigm, it was useful to distinguish, already at 
the beginning of Sec. [TJ between exogenous and endoge¬ 
nous mechanisms of trait dynamics. As explained, only 
the exogenous contribution is usually considered within 
the cultural dynamics paradigm, on which this work is 
largely based. In this sense, the focus is on social influ¬ 
ence interactions taking place between agents, which in 
turn are only capable of changing their traits in response 
to the interactions. This type of description is attrac¬ 
tive from a physics and a complex systems perspective, 
as it formulates macroscopic behavior in terms of interac¬ 
tions between a large number of microscopic constituents. 
Moreover, this approach also seems promising from a so¬ 
cial science perspective, as it embraces both the dynam¬ 
ical nature and the multidimensional nature of culture, 
thus providing the ground on which a general theory of 
trait dynamics might one day be built, a theory which in 
principle would constitute a solution for the problem of 
preference formation. However, a complete theory should 
also account for endogenous processes, namely those that 
can take place in the minds of individuals even in the ab¬ 
sence of social influence interactions. Formally, any kind 
of agent-level dynamical rules can be imagined (such as 
the cross-topic and reality-based consistency mechanisms 
described in Sec. [I]) implemented and studied (at least 
numerically), since agent-based modeling is highly versa¬ 
tile. The difficulty consists in empirically validating such 
rules and tuning the associated free parameters, since 
processes taking place in the human mind are very hard 
to probe experimentally. 

Adding endogenous processes to exogenous dynamics 
models in a realistic way would definitely make good 
use of input from psychology and neuroscience, disci¬ 
plines which are much more focused on the individual 
human mind and brain, respectively. Of course, by mak¬ 
ing use exclusively of notions from these two disciplines, 
one might fall in the other extreme, that of neglecting 
the exogenous aspect of trait dynamics. This is why an 
interdisciplinary approach is best suited for developing a 
general, complete theory of trait dynamics and for under¬ 
standing how preferences are formed. This is something 
that was not attempted in this study, although it is a 
meaningful and arguably ambitious goal for future re¬ 
search. The purpose of this research was to indirectly 
study the combination between endogenous and exoge¬ 
nous processes taking place in the real world, by looking 
at isolated snapshots of this dynamics - the empirical 
cultural states. 

One may wonder how important endogenous mecha¬ 
nisms actually are for trait dynamics, and whether they 
are worth mentioning at all. In a certain sense, this study 
suggests, together with Refs. [18. and Dll, that the en¬ 
dogenous contribution is relatively high: had the struc¬ 
ture present in empirical cultural states been dominated 


by the exogenous component, an exogenous dynamics 
model would have given similar results when acting on 
uniformly random initial conditions compared to empir¬ 
ical initial conditions. This expectation is natural if one 
believes that an empirical cultural state is somehow inter¬ 
mediate between an initial and a final state of exogenous 
dynamics, especially since uniform randomness, typically 
used for generating the initial state, can be interpreted 
as a complete lack of structure. But this scenario is obvi¬ 
ously invalid, since exogenous dynamics clearly discrim¬ 
inates empirical cultural states from those generated in 
a uniformly random way. Thus, the results of this work 
confirm that empirical cultural states should be generally 
regarded as snapshots of a joint, endogenous-exogenous 
dynamical process. As discussed in Sec. |VB[ there is 
one argument against this reasoning, having to do with 
the fact that an empirical cultural state is obtained by 
sampling individuals from a geographically distributed 
population. However, based on previous results, one can 
argue that this in itself would not account for the dis¬ 
crepancy between empirical and random data. 

It is worth noting that there are models of exogenous 
dynamics with so-called “cultural drift” [7], which in¬ 
volves arbitrary changes of traits irrespective of social 
influence. This can be seen as a first step in the direction 
of adding endogenous processes, if one interprets cultural 
drift as a primitive model of the endogenous component. 
However, cultural drift has been modeled as uniformly 
random, which, based on current results, apparently is 
not a good way of accounting for endogenous processes. 
In order to model these processes in a realistic way, more 
needs to be understood about these processes in the real 
world. 


B. Empirical cultural states 


Very important for this study is the possibility of con¬ 
verting empirical data from social surveys into sets of cul¬ 
tural vectors. This is a problem of data formatting that 
is challenging in several ways (see Sec. 0 To a great 
extent, the task is to select a meaningful set of variables 
from those used in the survey that can be consistently 
interpreted as cultural features, in the sense of the Axel¬ 
rod model. Specifically, one is interested in variables that 
record attributes of individuals that can be conceivably 
altered by social influence. In addition to demographic 
variables, which should be obviously disregarded, there 
are certain variables that need to be discarded for a va¬ 
riety of reasons (see Sec. 0. even if many of them do 
encode culturally-relevant information. This amounts to 
an overall “loss of resolution” on cultural distances, which 
is not of much concern, as long as enough variables are 
left after this filtering: one can always argue that the 
survey could have contained more questions than it did, 
so there is always room for improving the values of the 
cultural distances. Indeed, the universal patterns shown 


in Sec. IV are visible despite the large variability in the 
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numbers of features constructed from the four data sets, 
and they are robust to reducing the number of features 
within one data set, which is explicitly shown in the case 
of the Eurobarometer: the LTCD-STCB analysis of Fig. 
[4] is carried out only using the nominal features, while 
that of Fig. [5] is carried out with both nominal and ordi¬ 
nal features, but leads to very similar results. 

Once the formatting of the empirical data is complete, 
the resulting cultural state is assumed to be informative 
as a snapshot of the evolution of culture in the respective 
region of the world. However, in addition to exogenous 
and endogenous processes of trait dynamics, the empiri¬ 
cal cultural states also reflects other effects, having to do 
with how the data is collected in the first place. The sam¬ 
pling bias is one such effect: certain people are less will¬ 
ing to participate in a social survey, or are less reachable 
by the survey because of living in a remote area. This 
bias somewhat alters the way individuals are distributed 
in cultural space, because their recorded preferences and 
opinions are likely correlated, to a certain extent, with 
their willingness to participate in a survey and to their 
place of residence. It is generally difficult to estimate how 
large this bias is and to compensate for it. Nonetheless, 
it is reasonable to expect that the strength and nature 
of this bias depends on the data set, because different 
surveys are technically conducted in different ways, for 
different purposes and in different geographical regions. 
The fact that the LTCD-STCB plot is independent of the 
data set and of the geographical regions (Fig. [ 4 ]) suggests 
that the sampling bias is negligible for the purpose of this 
analysis, and that results discussed here cannot be and 
artifact of it. 

Moreover, each empirical cultural state also reflects the 
fact that the set of selected individuals is a sparse sub¬ 
sampling from a geographically distributed population, 
of a much larger size. It is unlikely (and sometimes de¬ 
liberately avoided through the survey design) that any of 
these individuals significantly interact directly with each 
other in the real world. However, when the exogenous 
dynamics model used here is applied to the empirical cul¬ 
tural state, one conceptually puts all these individuals in 
a “isolated container”, where they can interact with each 
other for a long period of time, while constantly “mixing 
the content” of the container - since there is effectively 
no spatial or social structure constraining the way the 
agents in this model interact, while the interacting pair 
of individuals is selected at random at every moment in 
time. 

This observation can be used as an argument for mini¬ 
mizing the importance of endogenous processes: one can 
argue that the structure inherent in an empirical cul¬ 
tural space is just the result of exogenous processes taking 
place in the real world, in combination with the sparse 
sub-sampling. Following this reasoning, one would ex¬ 
pect that empirical-like structure can be obtained via the 
following procedure: run an exogenous dynamics model 
with a large population of agents that are geographically 
distributed (in physical space); define a set of cultural 


vectors by selecting a small fraction of random agents in 
the final state; apply the LTCD-STCB analysis on this 
set of cultural vectors, for its shuffled and for its ran¬ 
dom counterparts, expecting that the universal behavior 
emphasized in Sec. IV is visible. 

The authors’ expectation is that this argument is not 
valid: would it be valid, one should also expect that the 
respective, more realistic model of exogenous dynamics 
would behave independently of the initial cultural state, 
unlike the minimalistic one used here. But this expecta¬ 
tion already seems to be falsified, for small agent popu¬ 
lations, by the results of Ref. US], where such a realistic, 
Axelrod-type model is applied to empirical, shuffled and 
random initial cultural states, leading to different results 
for the three case^J- the respective version of the Axel¬ 
rod model, which is an adaptation of that developed in 
Ref. [13], accounts for a physical, 2-dimensional space, 
for movement of agents in this space and for a dynamic 
social networks, with appropriate couplings to the dy¬ 
namics in cultural space. Nonetheless, it is worth check¬ 
ing explicitly, in future research, the validity of such an 
argument, using the above procedure. If it does not work, 
the same procedure can be employed for a-posteriori val¬ 
idating models of trait dynamics with endogenous ingre¬ 
dients added to the exogenous component - see Sec. |V A 


The fact that other, more realistic versions of the Ax¬ 
elrod model can be conceived apparently provides an ar¬ 
gument against part of the claimed meaning and impli¬ 
cations of present results: had the LTCD quantity been 
computed using a different model, one that would ac¬ 
count for the physical space and for the social network, 
the dependence of the LTCD quantity on the bounded- 
confidence threshold w would probably be different. This 
difference would likely manifest through a different posi¬ 
tion and width of the LTDC(w) phase transition, which 
for empirical data would likely no longer overlap so well 
with the STCB(w) phase transition. This reasoning sug¬ 
gests that the criticality claim made in Sec. |IV| is sensi¬ 
tive to details of the exogenous dynamics model used for 
computing the LTCD quantity. 

However, this argumentation is misleading. In the 
present study, there is a certain parallelism between how 
the LTCD and the STCB quantities are computed as 
functions of w - see Sec. HI they both condition so¬ 
cial influence only on the distance in cultural space, via 
the w threshold, while both disregarding the physical 
space and the social network. If the latter ingredients 
are to be included in the computation of LTCD, they 
should also be included in that of STCB, for the purpose 
of comparing the two phase transitions and for plotting 
the LTCD-STCB curves. For a certain value of ui, the 
presence of physical space and social network would in¬ 
crease the number of cultural domains in the final state 
of the Axelrod-type model, thus increasing the LTCD 


2 although using an analysis tool somewhat different than the 
LTCD-STCB combination used here. 
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value, but at the same time would also increase the num¬ 
ber of connected components in the initial state, thus 
decreasing the STCB value - the STCB quantity would 
essentially work with the connected component in the 
intersected graph, where a link between two agents is 
present when they are connected not only culturally, but 
also physically and socially; even if there is one connected 
component in each of the three graphs, one can still have 
multiple connected components in the intersected graph. 
Thus, it is reasonable to expect that the two phase tran¬ 
sitions would shift in the same way upon adding model 
ingredients, if this is done consistently on both sides, 
while retaining the shape of the LTCD-STCB curve and 
justifying the criticality claim in a more generic sense. 
Nonetheless, this is another aspect worth checking ex¬ 
plicitly in further research. 

Yet another objection can be raised. For either data 
set, shuffling destroys correlations between cultural fea¬ 
tures. Thus, one can argue that the difference between 
the empirical and the shuffled regime of the LTCD-STCB 
analysis may simply be due to the presence of feature- 
feature correlations, which in turn are supposedly due 
to “design details” of the social survey, having to do 
with certain questions being similar to each other. Con¬ 
sequently, there would be no need to think about dy¬ 
namical mechanisms responsible for the empirical struc¬ 
ture, about integrating endogeneous and exogeneous dy¬ 
namical ingredients or about a complete theory of cul¬ 
tural dynamics and preference formation. However, the 
a-priori expectation is that design-induced correlations 
are relatively weak: collecting social survey data is ex¬ 
pensive, so the survey should be designed such that it 
captures as much as possible of the relevant degrees of 
freedom, by minimizing the similarities among questions. 
Moreover, remaining similarities should be specific to 
each data set, whereas the LTCD-STCB analysis gives 
highly similar results for different data sets (the univer¬ 
sality claim). To better illustrate this counterargument, 
feature-feature correlations were measured in Sec. IfVI 
and explicitly shown to be specific to each social survey, 
which is compatible with the idea that they largely de¬ 
pend on “design details”. [^] In truth, feature-feature cor¬ 
relations can be seen as one of several manifestations of 
a non-uniform cultural space distribution, which is cer¬ 
tainly also affected by (arguably unavoidable) a-priori, 
survey-dependent similarities between features, but not 
in an essential way: the universality of the results of the 
LTCD-STCB analysis should be due to something uni¬ 
versal about the shape of the cultural space distribution, 


! Interestingly, the largest level of correlation is visible for the 
Jester (JS) data set, which is certainly the least expensive to 
collect, since respondents provide their answers online, via an 
automated platform. Moreover, the second-largest level of corre¬ 
lation is present in the Religious Landscape (RL) data set, which 
is arguably the second-least expensive to collect, since it relies 
on telephone interviews, while the other two data sets rely on 
face-to-face interviews. 


which in turn is likely due to non-trivial, universal laws 
governing the dynamics of culture in the real world. It is 
also worth noting that one cannot say to what extent a 
correlation between two features is caused by an a-priori 
similarity between the two questions and to what extent 
it arises dynamically due to the combined effect of en¬ 
dogeneous and exogeneous processes taking place in the 
real world. One can even argue that trying to disen¬ 
tangle the a-priori contribution is entirely meaningless, 
partly because the questions themselves are formulated 
by humans who interact with each other and with society. 


VI. SUMMARY AND CONCLUSIONS 


This manuscript started with emphasizing the impor¬ 
tant problem of trait formation. It argued that solving 
this problem would be equivalent to formulating and val¬ 
idating a complete theory of trait dynamics, which would 
embrace the dynamical and multidimensional aspects of 
culture. Both these aspects are accounted for in the exist¬ 
ing cultural dynamics paradigm. However, this paradigm 
mostly focuses on social influence, thus on exogenous 
mechanisms, typically disregarding any endogenous ones. 
On the other hand, previous work had shown that the 
outcome of exogenous dynamics strongly depends on the 
cultural state used for the initial conditions. Specifically, 
there is more long-term cultural diversity (LTCD) for a 
given level of short-term collective behavior (STCB) for 
an empirical cultural state than for shuffled or random 
ones that are generated in the same cultural space. This 
provided a quantitative tool for highlighting the struc¬ 
ture in empirical sets of cultural vectors, while raising 
the question of whether results obtained with it are in¬ 
dependent of the empirical data set. 

The study focused on addressing this question. It 
showed that the behavior of LTCD as a function of STCB 
is strikingly similar across data sets, implying that em¬ 
pirical cultural states have non-trivial, universal proper¬ 
ties, which apparently cannot be recognized in the ma¬ 
trix of feature-feature correlations. Moreover, it pointed 
out that a presence of both LTCD and STCB in real 
world systems implies that these systems function close 
to criticality with resp ect t o both aspects. One can ob¬ 
ject to this claim (Sec. VB) by pointing out an apparent 
sensitivity on the assumptions implicit in the formula¬ 
tion of LTCD. However, the assumptions behind LTCD 
are in this study paralleled by similar assumptions be¬ 
hind STCB, so modifying the exact formulation of LTCD 
should only be done while consistently modifying the ex¬ 
act formulation of STCB. One can thus reasonably ex¬ 
pect that criticality would turn out to be robust if further 
investigations in this directions are made. 

The LTCD-STCB analysis makes clear that conver¬ 
gence of cultural vectors under the action of exogenous 
dynamics takes place differently when starting with em¬ 
pirical data than when starting with random data. This 
suggests (Sec. V A) that an empirical cultural state is not 
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an intermediate state between a random cultural state 
and an absorbing state of an Axelrod-type model. In 
other words, empirical structure is arguably not a con¬ 
sequence of exogenous mechanisms alone: endogenous 
mechanisms should have a significant contribution. An 
objection against this claim can also be formulated (Sec. 

): one can argue that exogenous dynamics appears to 
take place differently when starting with empirical data 
because this is sampled from a very large, geographically 
distributed population which had already been subject 
to exogenous dynamics for a long time. If this were true, 
there would be no need for endogenous processes when 
explaining the structure in the empirical data. Although 
this alternative explanation is worth testing in future re¬ 
search, there already exist indirect evidence against it. 

This study demonstrated that one can learn about cul¬ 
tural dynamics from empirical data, even if this data con¬ 
sists of single, isolated, partial snapshots of this dynam¬ 
ics. Moreover, it emphasized the importance of endoge¬ 
nous processes and of their interplay with exogenous pro¬ 
cesses: including plausible endogenous processes, which 
was been very recently attempted for the first time |16| . 
might be crucial for making existing dynamical models 
of culture realistic. The LTCD-STCB plot used here can 
conceivably be employed as a tool for validating such 
models. 
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Appendix A: Empirical data formatting 

This section explains various details concerning the for¬ 
matting of the empirical data. As previously mentioned, 
four data sets were employed, each of which was collected 
by different entities, for different purposes and in differ¬ 
ent formats. In order for the analysis and modeling con¬ 
ducted here to be carried out consistently, the important 
information had to be extracted from each data set and 
expressed in one, unified format. Essentially, this for¬ 
mat dictates that each data set has to provide a certain 
number of ordinal features and a certain number of nom¬ 
inal features, where each feature has a certain number 
of possible traits (the range q of the feature), and that 
the traits of every individual in the data set are recorded 
with respect to all these features. This unified format 
can be effectively thought of as a table of traits, where 
the rows correspond to the features and the columns cor¬ 
respond to the individuals. There are various challenges 
involved when converting the data into this format. It 


is worth explaining first the challenges that are more 
generic, namely relevant for several data sets and then 
moving to the challenges specific to each data set. 

One of the difficulties consists in deciding, for each 
variable, whether it should be used as cultural feature 
or not. The following is a (not entirely exhaustive) list 
of types of variables which are worth mentioning in this 
regard: 

• demographic variables, such as those encoding 
“age”, “place of residence” or “ethnicity” are dis¬ 
carded, as they do not record subjective human 
traits; 

• certain variables, that were not seen as demo¬ 
graphic variables by the survey authors, are also 
discarded if they recorded information about some¬ 
thing that is too much in the respondent’s past, or 
about something that cannot be easily related to 
subjective preferences, opinion, values, beliefs or 
behavioral tendencies that can be conceivably al¬ 
tered via social influence in a reasonably easy way; 
often, the boundary between what is subjective and 
what is objective not clear; nonetheless, one can 
strive to make these decision consistently at the 
level of every data set, which is what was done here; 

• there are questions that ask opinions with respect 
to something that is differently defined for different 
people in the survey, such as: “how satisfied you 
are about how the the economy of this country is 
going recently?” - if there are people from different 
countries in the data set, or “how satisfied you are 
with your life?”; these questions are also discarded; 

• questions asking the respondent to self-evaluate a 
certain, personal trait, such as “would you say 
about yourself that you are more conservatory or 
liberal on political affairs”, are retained, assuming 
that the respondent mostly self-evaluates, in a rea¬ 
sonably objective way, a personal (subjective) trait, 
rather than expressing a subjective opinion about 
a the personal trait; 

• certain variables containing relevant information 
are also discarded if, due to the survey format, 
they can only be answered when certain answers 
are given to other variables, or if the set of possi¬ 
ble answers explicitly depends on answers given to 
other questions, regardless of whether these ’’other” 
variables themselves are selected or not; including 
such variables would introduce inconsistencies in 
the encoding of cultural vectors, the definition of 
cultural distance and the shuffling and randomiza¬ 
tion procedures. 

The variables that are retained for further analysis 
need to be encoded either as nominal or ordinal cultural 
features. Deciding between the two encoding options was 
done here using the following criterion: if there are more 
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than two possible answers that are not “neutral” (see 
next paragraph) and they can all be conceivably ordered 
along the real axis, then the variable is encoded as or¬ 
dinal; if, instead, there are only two answers (typically 
“Yes” and “No”) in addition to the neutral ones, or if 
the non-neutral answers cannot be ordered along the real 
axis in a consistent way, then the variable is encoded as 
nominal. 

Most variables retained from the data sets also allow 
for one or more “neutral” answers (often called “missing 
values” in social science research, although this term usu¬ 
ally is somewhat more general). These are usually labeled 
as “Don’t know”, “Refused” or “Not Answered”. For fur¬ 
ther analysis, these neutral answers are merged (if more 
than one are present). If the variable is to be encoded as 
nominal, neutral answers are mapped to one, additional 
cultural trait, side-by-side with traits originating from 
non-neutral answers. If the variable is to be encoded as 
ordinal, they are mapped to the middle of the ordinal 
scale - if there is an even number of possible answers, for 
each person, the choice is randomly made between the 
two answers closest to the middle of the scale. 

Note that some data sets (GSS and EBM below) for¬ 
mally allow for another type of answer, labeled as “IAP” 
or “INAP” (inapplicable), which is here regarded as sep¬ 
arate from neutral answers (although in social science 
research they are often all placed under the “missing val¬ 
ues” umbrella term). IAP values are recorded, for certain 
respondents, when answers to a specific question are not 
expected from those respondents, for reasons having to 
do with the design of the survey. This happens for ques¬ 
tion that are only asked conditionally on answers given 
before. However, as mentioned above, these conditional 
variables are anyway discarded. Similarly, IAP values 
are also recorded for questions that are only asked to 
a certain sub-sample of the people, although not being 
conditional on some other question, in which case those 
questions are either removed or, if the sub-sample is large 
enough, the formatting is restricted to it. Finally, IAP 
values are also recorded for split-ballot or split-form vari¬ 
ables (see GSS and EBM explanations below), in which 
case specific procedures are followed, which effectively 
discard all IAP answers before further analysis. Thus, 
regardless of how exactly they occur, one does not need 
to map IAP answers to any trait, as they are all filtered 
out as a consequence of other formatting rules. Note that 
for the RL data set, although IAP answers are not explic¬ 
itly mentioned anywhere, this could have been the case, 
since there are questions that are conditionally asked on 
other questions - instead of IAP answers, system-missing 
values are present in the SPSS file, typically marked by 
the dot character. 

First, this study made use of the Jester 2 (JS) data 
set [35], which consists of online ratings of jokes col¬ 
lected between November 2006 and May 2009. There are 
around 1.7 million continuous ratings (on a scale from 
-10.00 to +10.00) of 150 jokes from 59,132 users. For 
most users however, of the 150 jokes, only 128 are pro¬ 


vided as items to be rated, as the other 22 were elimi¬ 
nated at a certain point in time. For this study, each of 
the 128 items is converted into an ordinal feature with 

7 traits (by splitting the [—10,10] interval into 7 bins of 
equal size, while assuming that everything falling within 
one bin constitutes the same answer). Moreover, only 
the 2916 users that had rated all items were retained for 
further analysis - although this introduces some bias in 
the sample, one can argue that it is desirable to focus 
on individuals that have rated everything, as this is an 
indication of commitment on the respondent’s side. 

Second, the research used the Religious Landscape 
(RL) data set [33], which consists of opinions and at¬ 
titudes on various religious topics, but also on various 
political an social issues. This data was collected in 2007 
via telephone interviews from all states of USA - this 
study only used the data obtained from the continental 
part of the USA (without Hawaii and Alaska). There 
are multiple questions asking about the religious affili¬ 
ation of respondents, which were all discarded. This is 
partly based on the assumption that religious affiliation 
is closer to a demographic variable than to a feature that 
can be easily altered via social influence, partly based on 
the very large number of answers and the nested, hierar¬ 
chical nature of how they are organized. For this study, 
36 cultural features were constructed (18 nominal and 18 
are ordinal), for a number of 35558 respondents. 

Third, the research used the Eurobarometer 38.1 
(EBM) data set [32], which consists of opinions on sci¬ 
ence, technology, environment and various EU political 
issues (mainly related to the open market and the econ¬ 
omy). The data was collected during November 1992, 
from 12 countries of the EU, via face-to-face interviews. 
In this survey, there are several blocks of “coupled” vari¬ 
ables which are all discarded: within each block, there are 
explicit internal constraints on how answers can be given 
(such as answering “yes” to at most 3 questions out of 

8 that are available), which do not allow for a consistent 
encoding as a set of nominal or ordinal features. 

Another challenge when formatting the EBM data set 
is posed by the split-ballot procedure: the sample of peo¬ 
ple is split into 2 ballots, and certain questions are asked 
in slightly different versions (small differences in formu¬ 
lation, answers listed in different orders etc.) to the two 
ballots, while both versions are present in the SPSS file 
for all individuals - for every respondent, an IAP answer 
is recorded for the version that is not used for that re¬ 
spondent. The most meaningful approach is to merge 
the two versions and eliminate all IAP answers - if both 
versions are kept, strong structural artifacts arise in the 
matrix of cultural distances m- Most of the split ballot 
variables are encoded as ordinal and have the same range 
(same number of non-neutral answers) in both versions, 
such that a one-to-one correspondence can be made, sim¬ 
ilarly to Ref. 19j. Some of them are still ordinal but have 
different ranges in the two versions. In all these cases, 
there is a difference of only one trait among the two ver¬ 
sions, such that one range is an even number while the 
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other is odd. In this case, the odd version is kept for 
the merging, which guarantees the existence of a mid¬ 
dle trait to which all neutral answers can be directly as¬ 
signed. The non-neutral answers from the even version 
are mapped to the closest answers in the odd version, in 
terms of the distance from the lowest-value answer, as¬ 
suming that the distance between the lowest-value and 
highest-value answers is the same in the two versions 
(consistent with the definition of cultural distance in Eq. 
0 ). There is one split ballot variable which is encoded 
as nominal, in which case the difference consists in a sec¬ 
ond question being asked for one of the ballots, which is 
simply discarded. After all the formatting, 144 cultural 
features are constructed from this data set (54 nominal 
and 90 ordinal), for a number of 13026 respondents. 

Fourth, the study used the General Social Survey 
(GSS) data [33], collected during 1993 in the USA via 
face-to-face interviews. The overall scheme of how ques¬ 
tions are asked to respondents is arguably more compli¬ 
cated than for the EBM data set. First, there is a split- 
form procedure involved, which is equivalent to what is 
called “split-ballot” in the case of EBM: the respondents 
are split into two groups, with certain questions being 
asked in two, slightly different versions. All these ques¬ 
tions are ordinal and have the same ranges in the two 
forms; they are handled like in the case of EBM. Inde¬ 
pendently of the split-form procedure, there is another 
procedure called “split-ballot”, which is methodologically 
somewhat different: the sample of respondents is split in 
3 ballots (A,B,C), while some questions are only asked to 
2 of the 3 ballots (A and B, B and C or A and C). This 
is handled by discarding the questions asked to only 2 of 


the 3 ballots. Independently of the split-ballot and split- 
form procedures, there is a set of questions, also used 
within the International Social Survey Program (ISSP), 
which are not asked to a small fraction of respondents (49 
out of 1608 respondents). This is handled by discarding 
the 49 people not exposed to the ISSP questions. All in 
all, 133 cultural features are constructed from the GSS 
data (8 nominal and 125 ordinal), for a number of 1559 
respondents. 


Appendix B: Feature-feature correlations 

This section illustrates in detail the correlations be¬ 
tween cultural features, computed according to Eq. (|3|. 
The feature-feature correlation matrices of the four em¬ 
pirical SCVs are shown in Fig. [6] while those of the four 
shuffled counterparts are shown in Fig. [7] The ordering 
or rows and columns is consistent with the actual order¬ 
ing of questions in the four data sets. This leads to a 
partial block-diagonal aspect of the matrices associated 
to the Eurobarometer and Religious Landscape data sets, 
for which questions that deal with similar topics tend to 
appear next to each other. Note that, empirical corre¬ 
lations rarely show strong deviations from their shuffled 
counterparts. There is a clear discrepancy between the 
Eurobarometer correlation matrix shown here and that 
shown in the Supplementary Information of Ref. T81 . 
However, the current study used a different, much more 
rigorous procedure of formatting the empirical data. 
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